A Faster Algorithm for Generalized Multiple-Instance Learning
نویسندگان
چکیده
In our prior work, we introduced a generalization of the multiple-instance learning (MIL) model in which a bag’s label is not based on a single instance’s proximity to a single target point. Rather, a bag is positive if and only if it contains a collection of instances, each near one of a set of target points. This generalized model is much more expressive than the conventional multipleinstance model, and our first algorithm in this model had significantly lower generalization error on several applications when compared to algorithms in the conventional MIL model. However, our learning algorithm for this model required significant time and memory to run. Here we present and empirically evaluate a new algorithm, testing it on data from drug discovery and content-based image retrieval. Our experimental results show that it has the same generalization ability as our previous algorithm, but requires much less computation time and memory.
منابع مشابه
A Faster Algorithm for Generalized Multiple-Instance Learning Qingping Tao
In our prior work, we introduced a generalization of the multiple-instance learning (MIL) model in which a bag’s label is not based on a single instance’s proximity to a single target point. Rather, a bag is positive if and only if it contains a collection of instances, each near one of a set of target points. This generalized model is much more expressive than the conventional multipleinstance...
متن کاملMICCLLR: A Generalized Multiple-Instance Learning Algorithm Using Class Conditional Log Likelihood Ratio
We propose a new generalized multiple-instance learning (MIL) algorithm, MICCLLR (multiple-instance class conditional likelihood ratio), that converts the MI data into a single meta-instance data allowing any propositional classifier to be applied. Experimental results on a wide range of MI data sets show that MICCLLR is competitive with some of the best performing MIL algorithms reported in li...
متن کاملA Genetic Algorithm and a Model for the Resource Constrained Project Scheduling Problem with Multiple Crushable Modes
Abstract: This paper presents an exact model and a genetic algorithm for the multi-mode resource constrained project scheduling problem with generalized precedence relations in which the duration of an activity is determined by the mode selection and the duration reduction (crashing) applied within the selected mode. All resources considered are renewable. The objective is to determine a mode, ...
متن کاملPROTEIN FUNCTION CLASSIFICATION WITH GENERALIZED MULTIPLE INSTANCE LEARNING by Soon-Myung
We apply the generalized multiple-instance learning (GMIL) model and its learning algorithms, GMIL-2, and recently developed kernel-based GMIL, to the protein function classification problem. With the advancement in DNA sequencing techniques , new putative proteins are added to databases much faster rate than they can be tested to determine the function. Thus a fundamental challenge in computat...
متن کاملOn Generalized Multiple-instance Learning
We describe a generalization of the multiple-instance learning model in which a bag’s label is not based on a single instance’s proximity to a single target point. Rather, a bag is positive if and only if it contains a collection of instances, each near one of a set of target points. We list potential applications of this model (robot vision, content-based image retrieval, protein sequence iden...
متن کامل